A New Upper Bound on the Capacity of a Class of 

Primitive Relay Channels 

Ravi Tandon Sennur Ulukus 
Department of Electrical and Computer Engineering 
University of Maryland, College Park, MD 20742 
ravit@ umd. edu ulukus @ umd. edu 



Abstract — We obtain a new upper bound on the capacity of 
a class of discrete memoryless relay channels. For this class of 
relay channels, the relay observes an i.i.d. sequence T, which is 
independent of the channel input X. The channel is described 
by a set of probability transition functions p(y\x,t) for all 
(x, t, y) € X x T x y. Furthermore, a noiseless link of finite 
capacity R Q exists from the relay to the receiver. Although the 
capacity for these channels is not known in general, the capacity 
of a subclass of these channels, namely when T = g(X,Y), for 
some deterministic function g, was obtained in [1] and it was 
shown to be equal to the cut-set bound. Another instance where 
the capacity was obtained was in [2], where the channel output 
Y can be written as Y = X © Z, where ffi denotes modulo- 
m addition, Z is independent of X, \X\ = \y\ = m, and T is 
some stochastic function of Z. The compress-and-forward (CAF) 
achievability scheme [3] was shown to be capacity achieving in 
both cases. 

Using our upper bound we recover the capacity results of 
[1] and [2]. We also obtain the capacity of a class of channels 
which does not fall into either of the classes studied in [1] 
and [2]. For this class of channels, CAF scheme is shown to 
be optimal but capacity is strictly less than the cut-set bound 
for certain values of Ro. We further illustrate the usefulness of 
our bound by evaluating it for a particular relay channel with 
binary multiplicative states and binary additive noise for which 
the channel is given as Y = TX + N. We show that our upper 
bound is strictly better than the cut-set upper bound for certain 
values of Ro but it lies strictly above the rates yielded by the 
CAF achievability scheme. 



I. Introduction 

The relay channel is one of the simplest, yet arguably among 
the least understood multi-user channels in information theory. 
A special class of discrete memoryless relay channel is the 
primitive relay channel [1]. For this class, the channel is 
defined by a channel input X, a channel output Y and a relay 
output T, and a set of probability functions p(y,t\x) for all 
x G X. In this setting, the relay does not have an explicit coded 
input for the channel. Moreover, it is also assumed that there is 
an orthogonal link of finite capacity Ro, from the relay to the 
receiver. Zhang [4] considered this relay channel and obtained 
a partial converse for a degraded case. For a comprehensive 
survey on related work on primitive relay channels, see [5]. 

Recently, Kim [1] established the capacity of a class of 
semi-deterministic primitive relay channels, for which the 
relay output T can be expressed as a deterministic function 
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of the channel input X and the channel output Y, i.e., T = 
g(Y,X). The cut-set upper bound [6] was shown to be the 
capacity through an algebraic reduction of the compress-and- 
forward (CAF) achievable rate [3] to the cut-set upper bound. 
This was the first instance where the CAF achievability scheme 
was shown to be capacity achieving for any relay channel. 

In this paper, we consider a subclass of the primitive relay 
channel. In this subclass, the relay observes an i.i.d. sequence 
T which is independent of the channel input X and the channel 
output Y is given by the set of probability transition functions 
p(y\x, t) for all (x, t,y) <E X xTxy. Alternatively, this chan- 
nel can be interpreted as a state dependent discrete memoryless 
channel with rate-limited state information available at the 
receiver (Figure 1). This channel was also studied in [7] with 
various modifications regarding the rate-limited knowledge of 
the channel state T at the transmitter and the receiver. A 
CAF achievability scheme for this state dependent channel 
was given by Ahlswede and Han in [8] and it was conjectured 
to be the capacity for this class of channels. In fact, the same 
achievable rates for this channel were obtained in [7] and can 
also be obtained via Theorem 6 of [3]. 

It follows from the result of [1] that this conjecture is 
true for the subclass when the state T can be expressed as 
a deterministic function of X and Y, i.e., T = g(X,Y). 
An example of such a channel is the case when X, T and 
Y are all binary, T ~ Ber(<5) and independent of X, and 
the channel is given by F = X © T, where © denotes 
modulo-2 addition. Note that, in this case, T is a deterministic 
function of X and Y, since T = X © Y. A capacity result 
following up on the aforementioned modulo-additive noise 
channel was obtained in [2], where it was assumed that the 
receiver observes Y — X © Z and the relay observes a noisy 
version of the forward noise, i.e., T = Z © Z. Clearly, if 
Z = 0, then this channel reduces to the class studied in [1]. 
However, when Z ^ 0, T cannot be written as a deterministic 
function of X and Y, and this modulo-additive class lies 
outside of the class of channels considered in [1]. By proving 
a converse, it was shown in [2] that CAF scheme is capacity 
achieving for this modulo-additive case. The remarkable fact 
was that the capacity was shown to be strictly less than the 
cut-set upper bound for certain values of Rq. However, it is 
worth noting that the converse proved in [2] relied heavily on 
the modulo-additive nature of the forward channel. 

In this paper, we obtain a new upper bound on the capacity 
of the state-dependent discrete memoryless channel, where the 
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Figure 1: Channel with rate-limited state information. 

states are i.i.d. and the state information is available to the 
receiver through a noiseless link of finite capacity Rq. Our 
upper bound serves a dual purpose. Firstly, using our upper 
bound, we recover the capacity results obtained in [1] for the 
case where T = g(X,Y) and the capacity result obtained in 
[2] for the modulo -additive noise case. Secondly, we confirm 
the validity of the conjecture due to Ahlswede-Han [8] for 
another class of channels which does not fall into any of the 
cases considered in [1] and [2]. 

To further illustrate the application of our upper bound, we 
consider a channel where X, T, N are binary and Y is ternary 
and the channel is given by Y = TX + N, i.e., when the 
state sequence is binary and multiplicative and there is additive 
binary noise at the receiver. This channel can be interpreted 
as the discrete analogue of a fast fading channel with fade 
information available in a rate-limited fashion at the receiver. 
This channel does not fall into any class for which capacity 
is known. We evaluate our upper bound for this channel and 
show that it is strictly less than the cut-set bound for certain 
values of R although our upper bound is strictly larger than 
the rates yielded by the CAF scheme. 

II. Relay Channel Model 

We consider a relay channel with finite input alphabet X, 
finite output alphabet y and finite relay output alphabet T. 
Moreover, the relay observes an i.i.d. state sequence T n G T n 
with some given probability distribution p(t). The relay chan- 
nel is described by the set of transition probabilities p(y\x,i) 
which are defined for all (x, t, y) 6 X x T x y. Furthermore, 
there is a finite-capacity noiseless link of capacity Rq from the 
relay to the receiver. This relay channel can also be thought 
of as a state-dependent single-user channel with rate-limited 
state information available at the receiver (see Figure 1). 

An (n,M,P e ) code for this relay channel consists of the 
set of integers Ai = {1, 2, . . . , M} and the following: 
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where f t is the transmitter encoding function, f r is the relay 
encoding function and g is the decoding function. Furthermore, 
as the relay to receiver link is of limited capacity Rq, we have 



L < 2 nR ° 



(2) 



For a distribution p(w) on Ai, the joint probability distribution 

on Ai x X n x T" x y n is given as 

n n 

p(w,x n ,t n ,y n ) =p(w)p(x n \w)l[p(t i )l[p(y i \x i ,t i ) (3) 



For a uniform distribution p(w) on Ai, the average probability 
of error is given as, P e = Pr(<p(Y n , f r (T n )) ^ W). A rate R 
is achievable if for any e > and all n sufficiently large, there 
exists an (n,M,P e ) code such that P e < e and M > 2 nR . 
The capacity of the relay channel is the supremum of the set 
of all achievable rates. 

III. A New Upper Bound on the Capacity 

We will denote by U n as the output of the finite capacity 
link Rq, i.e., U n = f r (T n ). We will now obtain an upper 
bound on the rate as follows, 



n.R = H(W) 

= I(W; Y n , U n ) + H(W\Y n , U n ) 
<I(W;Y n ,U n )+ne n 
< I(X n ;Y n ,U n ) +ne n 
= I(X n ;Y n \U n ) + ne n 



nc r , 



Filthy*- 1 ; 

i=l 
n 

J2 \H(Y i \U n ,Y i - 1 )-H(Y i \U n ,Y i - 1 ,X n ] 



2 = 1 



(4) 
(5) 
(6) 
(7) 
(8) 

(9) 



(10) 



< [H(Y)-H(Y\U n ,Y l -\X n ) 

i=l 


+ ne 


(11) 


n 

< J2 [h(Y 1 ) - H(Y i \U n ,T i - 1 ,Y i - 1 ,X n ) 


+ ne n 


1=1 




(12) 


n 

= J2 [-ff(F) - H(Y i \U n ,T i ~ 1 ,X n ) 

i = l 


+ ne 


(13) 


n 

= J2 [H(Yi) -H{Y j \U n ,T i ~ 1 ,X i ) 

i=l 


+ ne n 


(14) 


n 

= J2 I (X l ,U n ,T i - 1 ;Y l )+ne n 

i=l 




(15) 


n 

= J2 I (X l ,V l ;Yi)+ne n 

i=l 




(16) 


= nI(X, V; Y) + ne n 




(17) 



where (|6]l follows by Fano's inequality [6], (0 follows from 
the data processing inequality, $Q follows from the fact that 
X n is independent of T n and is hence independent of U n , 
(fTTT i follows from the fact that conditioning reduces entropy 
and hence we upper bound by dropping (U n , Y 1 ' 1 ) from the 
first term. Next, (fT2l follows by adding in the conditional 
entropy in the second term and obtaining an upper bound, ( fT3l 
follows from the memoryless property of the channel, i.e., 
given (X 4_1 ,T i_1 ), the channel output y l_1 is independent 
of everything else and ( fT4l follows from the following Markov 
chain, X n \ X. t -> (X t , U",^- 1 ) -> Y,. The proof of this 
Markov chain is given at the beginning of next page. Finally, 
(fTol l follows by defining Vi = (U n ,T' l ~ 1 ), and we introduce 
a random variable Q, uniform on {1, 2, . . . , n} to define X = 
(X h Q), Y = (Y i} Q) and V = (Vi,Q) to arrive at O- 



We obtain the Markov chain by showing the following, 

Pr(y , X-*\ Xi , U», T*" 1 ) = Pr( ^;^!;^J^" 1) (18) 



_ E u P(U)Pr(Yj,X-\ Xj, 17",!*-%) 

Pr(Xi,U n ,T*-i) 
= E t , P(U)Pr(Xj, U n , fjjft^ X-'lXj, U, U n , T'- 1 ) 

_ gtj P(*i)Pr(^». T^^tQPrCX-^j, fj, CP, T^Pr^^ IP\T^\X^) 

Pf(jri,^,T*-i) 

= Pr(X-'|X t ) *' ' ' " ' ! (23) 

= Vx{X- i \X i )Y J ^{U\X u C/ n ,T J - 1 )Pr(K i |X i , U n ,T*- 1 ,t i ) (24) 
U 

= Pr^-^X^Pr^lX,-,^,^- 1 ) (25) 
where we have defined X" 1 = (X x , ■ ■ ■ , X^ x ,X i+ i, X n ). 
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In addition to ( TT7V we also need the following trivial upper 
bound on the rate, 
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where (f26l > follows by Fano's inequality, d27l i follows be- 
cause X™ is independent of T n , ( f30b follows by dropping 
(y , " 1 ,T n \ 2}) from the conditioning in the first term, 
d3Tl i follows from the memoryless property of the channel, 
i.e., given (Xj,2}), the channel output y is independent of 
everything else. 

We now obtain a bound on the allowable distributions of 
the involved random variables. Using the fact that the side 
information is limited by the rate Rq, we have that 



nR > I(T n ;U n ) 
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= nI(T;V) 

where (f36b follows from the fact that 2} are i.i.d. 

Combining ( fTTb . d33b and d37l i. we have an upper bound on 
the capacity of the relay channel as 

UB = sup min{/(X, V; Y), I(X; Y\T)} 
s.t. Rq > I(T; V) 

over p(x)p(t)p(v\t) (38) 

where the supremum can be restricted over those V such that 
|V|<|T| + 2. 

IV. Comparison with the Cut-set Bound 

The best known upper bound for the relay channel is the 
cut-set bound [6], which reduces for the relay channel in 
consideration to [1], [5] 

CS = maxmin{/(X; Y) + R , 1(X; Y\T)} (39) 

p(x) 

On comparing with the cut-set bound, it can be observed that 
our bound differs from the cut-set bound in the multiple access 
cut. We will show next that our upper bound is in general 
smaller than the cut-set bound. 

We start by upper bounding the expression I(X, V; Y) as 
follows, 

I(X, V; Y) = I(X; Y) + I(V; Y\X) (40) 
= I(X;Y)+H(V\X)-H(V\Y,X) (41) 
= I(X;Y)+H(V)-H(V\Y,X) (42) 
< I(X:Y) + H(V) - H(V\T,Y,X) (43) 
= I(X;Y)+H(V)-H(V\T) (44) 



= I(X;Y)+I(T;V) 
<I(X;Y)+R 



(45) 
(46) 



where d42l follows from the fact that V is independent of X, 
d43l follows from the fact that conditioning reduces entropy, 
d44l follows from the Markov chain (X, Y) — > T — > V and 
(g5j follows by using the fact that J(T; F) < i? . Using d46t 
and d33l ). we have the following 



Wfi < maxmin{/(X;r) 

P(as) 



i? ,/(X;F|T)} (47) 



Thus, our upper bound is in general smaller than the cut-set 
bound given in (1391 . It was shown in [1] that the cut-set bound 
is tight for the case when T = g(X, Y) and is achieved by the 
CAF achievability scheme. Note the fact that for this special 
subclass, the inequality in d43l is in fact an equality and our 
bound exactly equals the cut-set bound. 

V. Recovering the Capacity of Modulo-Additive 
Relay Channel 

A specific modulo-additive relay channel was considered in 
[2] for which the channel is given as, 



Y = X 
T = Zi 



) Z 

z 



(48) 
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where X, Y, T, Z and Z are all binary and Z ~ Ber(<5), 
Z ~ Bei(S). Clearly this channel does not fall into the class 
of channels studied in [1], where T can be written as a 
deterministic function of X and Y. It was shown that the 
capacity of this channel is given by [2, Theorem 1] 



C= max 1-H(Z\V) 

p(v\t):I(T;V)<R 



(50) 



We will show that our bound is equal to the capacity for this 
class of channels. First, note that 

I(X,V;Y)=H(Y)-H(Y\X,V) (51) 
= H(Y) - H(Z\V) (52) 
<1-H(Z\V) (53) 

where d53l follows by the fact that the entropy of a binary 
random variable is upper bounded by 1. Next, consider the 
other cut, 

I(X; Y\T) = H(Y\T) - H(Y\X, T) (54) 
= H(Y\T) - H(Z\T) (55) 
< 1 - H{Z\T) (56) 

We note that d53l and (TS6b are achieved with equality for a 
uniform X. Moreover, from d53l and d56l l. it can be observed 
that the bound I(X; Y\T) is redundant since V — > T — > Z 
implies H(Z\T) < H(Z\V). Hence, our upper bound reduces 
to 
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p(v\t):I(T;V)<R 



H(Z\Y) 



(57) 



We should remark that the converse obtained in [2] for this 
channel utilized the modulo-additive nature of the channel. 
For such a channel, a uniform distribution on X makes the 



channel output Y independent of noise Z, thereby making the 
proceedings in the converse easier. Our upper bound does not 
rely on the nature of the channel and holds for any p(y\x,t). 

We have thus shown that for all the cases where the capacity 
is established, our bound is tight. To illustrate the usefulness 
of our bound, we will consider a channel which does not fall 
into any of these classes. 

VI. Capacity Result for a Symmetric Binary 
Erasure Channel with Two States 

We will show that for a particular symmetric binary input 
erasure channel with two states, our upper bound yields the 
capacity which turns out to be strictly less than the cut-set 
bound. The state T is binary with Pr(T = 0) = a. The channel 
input X is binary and channel output Y is ternary. For channel 
states T = 0,1, the transition matrices p(y\x,t) are given as 
(Figure 2), 
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It should be noted that this class of channels does not fall 
into the class of channels considered in [1] since T cannot be 
obtained as a deterministic function of X and Y. Moreover, the 
channel output Y cannot be expressed in the form as Y = X® 
Z, for some p(t\z), where © is modulo-2 addition, since the 
cardinality of Y is different from the cardinality of X. Hence, 
the converse technique developed in [2] for modulo-additive 
relay channels does not apply to this channel. However, our 
upper bound holds for any p(y\x,t). We begin by evaluating 
the achievable rates given by the CAF scheme, 

C > sup I{X;Y\V) 

s.t. I(T;V\Y) < Ro 

for some p(x, t, v) = p(x)p(t)p(v\t) (58) 
Throughout this paper, we denote the entropy function as 



,s k . 



= ~Y^ SilOg(Si) 



(59) 



where Si > 0, i = 1, . . . , k and Sj = 1. We will denote 
the binary entropy function as h(s). We first define Pr(X = 
0) = p and obtain the involved probabilities, 
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Figure 2: A symmetric binary erasure channel with two 
states. 



a uniform distribution on X, 

I(X;Y\V) = H(Y\V) - H(Y\X,V) (82) 
= h{e) + e-H(U\V) (83) 

Furthermore, for uniform X, we have 7(T; U|F) = 7(T; V), 
thus the constraint in d58l simplifies to I(T; V) < Ro. For 
simplicity, define the set 

£( 7 ) = {p(t;|i) : H(T\V) > 7 ; V T (7} (84) 

Using ( f83l > and d84l i. we obtain a lower bound on the capacity 



where we have defined 

a*b = a(l-b) + b(l-a) (69) 
Furthermore, we also note the following inequality, 

(a, b, c) = ( a , b, c) + (c, b, a) (70) 



a + c , a + c 
b 



(71) 
(72) 



2 ' ' 2 
= h(b) + l-b 

Using this fact, we have 

H(Y) = /i (3) (e(a * p), 1 - e, e(l - a * p)) (73) 

< fc( e ) + e (74) 

Also, a uniform distribution on X, yields the maximum 
entropy for Y, and makes Y and T independent. Note that 
the maximum entropy of Y in this case is h(e) + e which is 
strictly less than log(3) for all e 6 [0, 1]. Hence, for a uniform 
X, we have 



H(Y\V)=H(Y) 

= h(e) + e 

We also define, 

r] v =Pr(T = 1\V = v), v = l,. 
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Using this definition, we can write H(Y\X, V) for any distri- 
bution p(x) on X as follows, 

H{Y\X, V) = J2p(v) J2p( x ) H ( Y \ X = x,V = v) (78) 

V X 

= ^2 P (v)h& (r, v e, 1 - e, (1 - f*,)e) (79) 

V 

= H{U\V) (80) 

where we have defined a random variable U with \U\ = 3 and 
expressed as a stochastic matrix W which is given as 
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Thus, H(Y\X, V) is invariant to the distribution of X. More- 
over, by construction, the random variables (T, U, V) satisfy 
the Markov chain V -> T -> U. 

We now return to the evaluation of the rates given by the 
CAF scheme given in (l58l . Using ( |76*] | and (l80l , we have for 



C>h(e) + e- inf ff(^|U) (85) 

P (»|t)e£(Mo)^„) 

We now evaluate our upper bound. Using the following fact, 

xmn(I(X, V; Y), I(X; Y\T)) < I(X, V; Y) (86) 

we obtain a weaker version of our upper bound in d38l as 

C < sup I(X,V:Y) (87) 

= sup(iT(F) — H(Y\X, V)) (88) 

< Bup(fc(e) + e - H(Y\X, V)) (89) 

= h(e) + e-m£H(Y\X,V) (90) 

= h(e) + e- inf H(U\V) (91) 

P(v\t)ec(h( a )-R ) 

where d89l follows from ( l74l i. and the sup in (f8Tb-d89b is taken 
over all p(x) and those p(v\t) which satisfy I(T; V) < Rq. 
Hence, from ( f85l > and d9Tb , the capacity is given by 



C = h(e) 



e- inf H(U\V) (92) 

p(v\t)£C(h(a)-R ) 



We will now explicitly evaluate the capacity expression ob- 
tained in d92l ) and compare it with the cut-set bound. For 
this purpose, we need a result on the conditional entropy 
of dependent random variables [9]. Let T, U be a pair of 
dependent random variables with a joint distribution p(t,u). 
For < 7 < H(T), define the function G( 7 ) as the infimum 
of H(U\V), with respect to all discrete random variables V 
such that H(T\V) = 7 and the random variables V and U 
are conditionally independent given T. For the case when T 
is binary and p(u\t), expressed as a stochastic matrix W, takes 
the form in (HTJ, we have from [9], 



G( 7 )= inf H(U\V) 
= h(e) + e 7 



(93) 
(94) 



We will use this result from [9] in explicitly evaluating the 
capacity in d92l ). First note that, if R > h(a), then 

G{h{a) - Ro) = G(0) = h(e) (95) 

whereas, if Rq < h(a), then 

G{h{a) - Rq) = h(e) + e(h(a) - Rq) (96) 

Using (|95]> and d96l ), the capacity expression in j92l evaluates 
to, 



C(R 



e, Rq > h(a) 

e(l - h(a)) + eRo, R < h(a) 



(97) 



which can be written in a compact form as, 

C{R ) = min(e(l - h(a)) + eR , e) (98) 

The cut-set bound is obtained by evaluating (1391 for the 
channel in consideration. Evaluation of the cut-set bound is 
straightforward by noting that I{X;Y) and I(X;Y\T) are 
both maximized by a uniform p(x). For a uniform distribution 
on X, we have the following equalities, 

I(X;Y) = e(l-h(a)) (99) 
I(X;Y\T) = e (100) 

Hence, the cut-set bound is given as, 

CS(Ro) = min(e(l - h{a)) + R ,e) (101) 

The difference between the capacity and the cut-set bound 
is evident from the first term in the min operation, i.e., 
the capacity expression in d98l l has an cRq appearing in the 
minimum, as opposed to Rq appearing in the cut-set bound at 
the corresponding place in (I10U . The cut-set bound and the 
capacity are shown in Figure 3 as functions of Rq for a = 0.3 
and e = 0.4. 

In conclusion, for this channel which does not fall into the 
classes of channels studied in [1] and [2], our upper bound 
equals the CAF achievable rate, thus yielding the capacity, 
which is strictly less than the cut-set bound for Rq < h(a). 



VII. A Channel with Binary Multiplicative State 
and Binary Additive Noise 

We will evaluate our upper bound and compare it with the 
cut-set bound for the case when X, T and N are binary and 
the channel is given as, 



Y = TX + N 



(102) 



The channel output Y takes values in the set {0,1,2}. The 
random variables T and N are distributed as T ~ Ber(a) and 
N ~ Ber(<5). This relay channel does not fall into the subclass 
of channels considered in [ 1 ] . Moreover, the converse obtained 
in [2] does not apply to this channel since the output cannot 
be written as a modulo-sum. 

To evaluate our upper bound, let us define 

Pr(X = l)=p (103) 
Pr(T = 1) = a (104) 
Pr(N = 1)=5 (105) 

We then obtain H(Y) as follows 

H(Y) = h^(P Y (0),P Y (l),PY(2)) (106) 

where 

Py(0)=p(l-a)(l-5) + (l-p)(l-5) (107) 

Py(l) = (1 -p)S + p[(l -a)S + a(l -5)} (108) 

P Y (2)=paS (109) 
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Figure 3: Capacity of the binary symmetric erasure channel 
for a = 0.3 and e = 0.4. 



and H(Y\X) is obtained as, 

H{Y\X) = (1 -p)H(N) +pH(T + N) (110) 

= (1 - p)h{5) + p/i (3) ((l - a)(l - 5), a * S, aS) 

(111) 

The broadcast cut is obtained as, 

I(X;Y\T) = H(Y\T) — H(Y\X,T) (112) 
= (1 - a)h(S) + ah (3) ((l -p)(l - S),p* 5,pS) 
-h{6) (113) 

The cut-set bound is given by, 

CS = maxmin{/(X;y) + R Q ,I(X;Y\T)} (114) 
p 

We now evaluate our bound by first considering, 

I(X,V;Y) = H(Y) - H{Y\X,V) (115) 

We have already evaluated H(Y) in (|106b . Consider 

H(Y\X, V): 

H(Y\X,V) = Px{x)P v (v)H(Y\X = x, V = v) 

(116) 

= ]T Pv{v) [(1 - p)H(Y\X = 0,V = v) 

V 

+ P H(Y\X = 1, V = v)\ (117) 
= E P V^)[(1-P)#(A0 

V 

+ pH(T + N\V = v)] (118) 
= p v(v) [(1 - p)h(S) + P H{T + N\V = v)] 

(119) 

= J2P v (v)[{l - p)h{5) + P H(W\V = v)] 

(120) 



= (1 -p)h(S) +pH(W\V) (121) 

where we have defined another random variable W as follows, 

W = T + N (122) 

We are interested in lower bounding H(W\V). We also know 
that any permissible conditional distribution p(v\t) satisfies 
the constraint I(T; V) < Rq. Using this, we also have the 
following, 

H(T\V) > h{a) - R (123) 

Let us also define, 

P Tlv (T = l\V = v) = th, v e 1, . . . , |V| (124) 
We now return to calculating H{W\V) 

?r{W = w\V = v) ^J2 P T\v(tHPw\T,vHt^) (125) 
t 

= {l-r) v )P(w\T = 0,V = v) 

+ r) v P(w\T = l,V = v) (126) 

Since the random variable W takes values in the set {0, 1, 2}, 
we obtain, 

Pr(W = 0\V = v) = (1 - r) v )(l - 5) (127) 
Pr(W = 1\V = v) = r] v * 8 (128) 
Pr(W = 2\V = v)= 7] v 8 (129) 

We finally obtain, 

H(W\V) = Pv(v)h^((l - r) v )(l - 5),r) v * S, rj v 5) 

(130) 

For the special case when the additive noise is N ~ Ber(l/2), 
the above expression simplifies to 

'(l-r] v ) 1 T), 



H{W\V) = ^Pv{v)h {i) 



2 ' 2' 2 



= -H(T\V) + l 

> - (h(a) - Rq) + 1 



(131) 

(132) 

(133) 
(134) 



where ( 1134b follows from (1123b . Substituting ( 1134b in (1121b 
we obtain 



H(Y\X, V) = (1 - p)h(S) + pH(W\V) 



(135) 



>(l-p)h(6)+p l-(h(a)-Ro) + l 



(136) 



Continuing from dl 15b , we obtain an upper bound on 
I(X, V; Y) as follows, 



I(X,V;Y) = H(Y) - H(Y\X,V) 



(137) 



< H(Y)-l-^(h(a)-R ) (138) 



Moreover, the first term appearing in the cut-set bound sim- 
plifies to 



I(X; Y) + R Q = H(Y) - H(Y\X) 



Rn 



(139) 



= H(Y) - 1 
We thus obtain our upper bound as, 



^h(a) + R (140) 



UB = max min 

pe[o,i] 



H(Y) - 1 - + pR , 1(X; Y\T) 



whereas the cut-set bound is, 



CS = max min 

pe[o,i] 



H(Y) - 1 - |M«) 



(141) 

R 0l I(X;Y\T) 

(142) 



The difference between the cut-set bound and our upper bound 
is evident from the first term in the min operation, i.e., our 
upper bound has a pRo term in (1141b . as opposed to i?o at the 
corresponding place in (11421) . 

Both these bounds along with the CAF rate are illustrated 
in Figure 4 as a function of i?o for the case when a = 1/2 
and 5 = 1/2. We should remark here that although our bound 
is strictly smaller than the cut-set bound for certain values 
of i?o, it is strictly larger than the rates given by the CAF 
scheme. Here, the CAF rates are evaluated by restricting V 
to be binary, i.e., by considering all conditional distributions 
p(v\t), such that, |V| = 2. Therefore, the CAF rates plotted 
in Figure 4 are potentially suboptimal and can be potentially 
improved upon by increasing the cardinality of V. 



VIII. Discussion 

Let us recall our upper bound obtained in 

UB = supmin{/(X, V; Y), I(X; Y\T)} 
s.t. i? > I(T; V) 

over p(x)p(t)p(v\t) (143) 

Using the fact that 

min(/(X, V; Y), I(X; Y\T)) < I(X, V; Y) (144) 

and observing that 

I(X,V;Y) = I(V;Y) + I(X;Y\V) (145) 

it can be noted that our upper bound in (1143b can be further 
upper bounded as 

C <sup/(U;r) +I(X;Y\V) (146) 
s.t. I(T; V) < Rq (147) 
for some p(x)p(v\t) (148) 

On the other hand, the capacity is always lower bounded by 
the CAF rate, 



C> sup I(X;Y\V) 

s.t. I(T;V\Y) < i? 
for some p(x)p(v\t) 



(149) 
(150) 
(151) 
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Figure 4: Comparison of our upper bound with the cut-set 
bound when T ~ Ber(l/2) and N ~ Ber(l/2). 

Now using the following fact, 

7(T; V\Y) = H(V\Y) — H(V\T) (152) 
= 7(T; V) - I(V; Y) (153) 

we can rewrite the CAF lower bound on the capacity as 

C >sixpI(X;Y\V) (154) 
s.t. 7(T; V) - I(V; Y) < R Q (155) 
for some p(x)p(v\t) (156) 

We can see that the CAF lower bound on the capacity involves 
taking a supremum of I(X;Y\V) subject to the constraint 
7(T; V) — I(V; Y) < Ro whereas our upper bound involves 
taking a supremum of a larger quantity I(V; Y) + I(X; Y\V) 
subject to a stricter constraint 7(T; V) < Ro- 

Although these two maximization problems are different, 
for the class of channels for which capacity was obtained, 
at the capacity achieving input distribution p(x), we had 
I(V; Y) = 0. Moreover, the same input distribution p(x) 
yielded the maximum for both maximization problems. Thus, 
for the class of channels considered in Section VI, these two 
maximization problems are equivalent. This observation yields 
a heuristic explanation as to why we were able to obtain the 
capacity results for these classes of channels. 

IX. A New Lower Bound on Critical Rq 

In [10], Cover posed a slightly different problem regarding 
the general primitive relay channel. Considering the capacity 
as a function of Ro, i.e., C(Ro), first observe the following 
facts, 

C(0) = supI(X;Y) (157) 

p(x) 

C(oo) = sup7(X;F|T) (158) 

p{x) 

Moreover, C{Ro) is a nondecreasing function of Rq. Cover 
posed the following question in [10]: what is the smallest value 
of Ro, say R^, for which C(Rq) = C(oo)? As an application 



of our upper bound, we implicitly provide a new lower bound 
on i?Q for the class of primitive relay channels studied in this 
paper. 

For the class of channels considered in Section VI, we 
obtained the capacity. As a consequence, we can explicitly 
characterize R^ for this class of channels as h(a). Further- 
more, for the class of channels considered in Section VII, our 
upper bound on the capacity yields an improved lower bound 
on Rq than the one provided by the cut-set bound, which is 
clearly evident in Figure 4. 

X. Conclusions 

We obtained a new upper bound for a class of primitive relay 
channels. The primitive relay channel studied in this paper can 
also be considered as a state-dependent discrete memoryless 
channel, with rate-limited state information available at the 
receiver and no state information available at the transmitter. 

Using our upper bound, we first recover all previously 
known capacity results for such channels. Furthermore, we 
explicitly characterize the capacity of a new subclass of these 
primitive relay channels which does not overlap with the 
classes previously studied in [1], [2]. In particular, for this 
class of channels, it is assumed that there are two channel 
states, and for each channel state, there is an erasure channel 
from X to Y, We show that the capacity for such channels is 
strictly smaller than the cut-set bound for certain values of Ro- 
This capacity result validates a conjecture due to Ahlswede 
and Han [8] for this class of channels. 

Moreover, we also evaluated our upper bound for a case 
where Y = TX + N, where T, X and N are binary. This 
channel does not fall into any of the classes studied in [1], 
[2] and neither does it fall into the aforementioned class of 
channels. We show that our upper bound strictly improves 
upon the cut-set bound for certain values of Rq, although, 
our upper bound is strictly larger than the rates yielded by a 
potentially suboptimal evaluation of the CAF scheme. 
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